Two Tales of the World: Comparison of Widely Used World News Datasets GDELT and EventRegistry

نویسندگان

  • Haewoon Kwak
  • Jisun An
چکیده

In this work, we compare GDELT and Event Registry, which monitor news articles worldwide and provide big data to researchers regarding scale, news sources, and news geography. We found significant differences in scale and news sources, but surprisingly, we observed high similarity in news geography between the two datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Understanding News Geography and Major Determinants of Global News Coverage of Disasters

In this work, we reveal the structure of global news coverage of disasters and its determinants by using a large-scale news coverage dataset collected by the GDELT (Global Data on Events, Location, and Tone) project that monitors news media in over 100 languages from the whole world. Significant variables in our hierarchical (mixed-effect) regression model, such as population, political stabili...

متن کامل

Critique of Research Book (Literature)/ A Treasure from Iranian Culture: A Review of Folklore Tales of Lorestan, Zahra Mohammad Hassani Saghiri

A Treasure from Iranian Culture A Review of Folklore Tales of Lorestan Zahra Mohammad Hassani Saghiri/PhD in Persian language and literature, Shahid Chamran University of Ahwaz, and Folk Literature researcher, [email protected] Abstract Folklore Tales of Lorestan is a collection of 4 volumes recently published by the Center of Islamic Great Encyclopedia. The present article reviews th...

متن کامل

Comparative Analysis of GDELT Data Using the News Site Contrast System

Abstract The News Site Contrast (NSContrast) system analyzes news articles retrieved from multiple news sites based on the concept of contrast set mining. It can extract terms that characterize different topics of interest across news sites, countries, and regions. In this study, we used NSContrast to analyze Global Database of Events, Language, and Tone (GDELT) data by comparing news articles ...

متن کامل

Visual and Predictive Analytics on Singapore News: Experiments on GDELT, Wikipedia, and ^STI

The open-source Global Database of Events, Language, and Tone (GDELT) is the most comprehensive and updated Big Data source of important terms extracted from international news articles . We focus only on GDELT’s Singapore events to better understand the data quality of its news articles, accuracy of its term extraction, and potential for prediction. To test news completeness and validity, we v...

متن کامل

Revealing the Hidden Patterns of News Photos: Analysis of Millions of News Photos through GDELT and Deep Learning-based Vision APIs

In this work, we analyze more than two million news photos published in January 2016. To our best knowledge, this is the first large-scale study of news photos by the GDELT project and deep learning-based vision APIs.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016